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Abstract — Recent advances in wireless sensor networks (WSNs) 
have led to many new promissing applications. However data 
communication between nodes consumes a large portion of the 
total energy of WSNs. Consequently efficient data aggregation 
technique can help greatly to reduce power consumption. Data 
aggregation has emerged as a basic approach in WSNs in 
order to reduce the number of transmissions of sensor nodes 
over aggregation tree and hence minimizing the overall power 
consumption in the network. If a sensor node fails during data 
aggregation then the aggregation tree is disconnected. Hence the 
WSNs rely on in-network aggregation for efficiency but a single 
faulty node can severely influence the outcome by contributing 
an arbitrary partial aggregate value. 

In this paper we have presented a distributed algorithm that 
reconstruct the aggregation tree from the initial aggregation tree 
excluding the faulty sensor node. This is a synchronous model 
that is completed in several rounds. Our proposed scheme can 
handle multiple number of faulty nodes as well. 

I. Introduction 

A wireless sensor networks (WSNs) consist of a large num- 
ber of spatially distributed autonomous resource-constrained 
tiny sensor devices which are used to lead many new promis- 
ing applications. The applications for WSNs are varied, 
typically involving some kind of monitoring, tracking, or 
controlling. Specific applications include: Habitat monitoring, 
Object tracking, Nuclear reactor control, Fire detection, Traffic 
monitoring, etc. However data communication between nodes 
consumes a large portion of the total energy of WSNs. 
Consequently efficient data aggregation technique can help 
greatly to reduce power consumption. Data aggregation has 
emerged as a basic approach in WSNs in order to reduce the 
number of transmissions of sensor nodes over aggregation tree 
and hence minimizing the overall power consumption in the 
network. 

Depending on the application, sensor nodes either report 
each and every measurement to a gateway or sink, or they 
perform in-network aggregation: En route to the sink, nodes 
combine their own measurement with the one of other nodes 
in proximity, e.g., their children on an aggregation tree rooted 
at the sink and spanning over all sensors JTJ. A large fraction 
of WSNs requires only a periodic collection of an aggregate 
value (e.g., count, sum, average, etc.), and can do so with low 
network overhead. With in-network aggregation, rather than 
relaying individual measurements across multiple hops, each 



node transmits a single packet, summarizing the data from an 
entire area of the WSNs. 

Typically, there are three types of nodes in WSNs: leaf 
sensor nodes, aggregators, and a querier (sink) [2J. The ag- 
gregators collect data from a subset of the network, aggregate 
the data using a suitable aggregation function and then transmit 
the aggregated result to an upper aggregator or to the querier 
who generates the query. The querier is entrusted with the task 
of processing the received sensor data and derives meaningful 
information reflecting the events in the target field. It can 
be the base station or sometimes an external user who has 
permission to interact with the network depending on the 
network architecture. Data communications between sensors, 
aggregators and the queriers consume a large portion of the 
total energy consumption of the WSNs. 

Most of the works [1|-[5 | in literature focused on secure 
aggregation in WSNs. Secure aggregation means protecting 
data from attackers, where attackers intend to change the 
aggregation value and mislead the sink (or base station) 
resulting in false aggregation. They considered faulty node 
as an attacker or adversary that can compromise with sensor 
nodes by controlling their functionality and inducing arbitrary 
deviations from the protocols. But in our proposed algorithm, 
a faulty node is considered as a physical fault. 

A sensor node is called faulty, if it cannot be able to com- 
municate with any other sensor node in the WSNs. A sensor 
node may fail due to lack of battery power or some hardware 
failures. We may consider node failure as a permanent failure. 

If a sensor node fails during data aggregation then the 
aggregation tree is disconnected. Hence the WSNs rely on 
in-network aggregation for efficiency but a single faulty node 
can severely influence the outcome by contributing an arbitrary 
partial aggregate value to the sink. 

In a typical application, a WSN is scattered in a region 
where it is meant to collect data through its sensor nodes. We 
consider WSNs as a weighted communication graph, G c = 
(V, E) (say) where each sensor node is a vertex belonging to 
a set V and the communication link between two sensor nodes 
is defined as an edge belonging to a set E. Here edge weight 
is the cartesian distance between two sensor nodes. One node 
can communicate with other nodes directly if they are in its 
transmission range. 

Using some distributed minimal spanning tree (MST) al- 



gorithm (6] it is possible to construct an initial aggregation 
tree (T a ). If one node fails, then we assume, by some fault 
detection algorithm [3], that other nodes which are directly 
connected with the faulty node can detect the fault and the 
aggregation tree is decomposed into number of trees (disjoint- 
set of forest) with respect to the aggregation tree. 

Our objective in the paper is as following: Given a weighted 
communication graph G c and corresponding aggregation tree 
T a with n nodes, if one arbitrary node, Vf (say) fails then 
how to reconstruct the aggregation tree with n — 1 nodes in a 
distributed way (excluding the faulty nodes), provided the re- 
duced communication graph, G c = (V , E ) is still connected 
after removal of the faulty node, Vf where V — V\{vf} and 
E = E \ { all edges are connected with vj}. 

A. Related Work: 

Chan et al. proposed a protocol [3| where they considered 
corrupted node as a malicious aggregator node. According 
to their protocol the answer given by aggregator is a good 
approximation of the true value even when the aggregator and 
a fraction of the sensor nodes are corrupted. In the paper [5 | 
Haghani et al. considered adversary node as a misbehavior 
node that can severely influence the outcome by contributing 
an arbitrary partial aggregate value. Their scheme relies on 
costly operation to localize and exclude nodes that manipulate 
the aggreagtion when a fault is detected. Gallager et al. [6| 
proposed a distributed algorithm (distributive implementation 
of Prim's algorithm) constructing a MST of a connected graph 
in which the edge weights are unique. Their algorithm works 
on a message passing model. It uses a bottom-up approach 
and the overall message complexity of the MST algorithm 
is 0{E + nig n). In the paper [|4] Gao and Zhu proposed a 
Dual-Head Cluster Based Secure Aggregation Scheme. 

B. Our results: 

The main contribution of this paper is a distributed al- 
gorithm for reconstruction of aggregation tree in wireless 
sensor networks when an arbitrary sensor node fail during 
aggregation. To the best of our knowledge, this is the first 
distributed protocol for reconstruction of aggregation tree 
which can handle multiple concurrent permanent node failure. 
Unlike Gallager et al. [6| algorithm the edge weights of 
underline communication graph may not be unique. We have 
proved that the reconstructed aggregation tree is again a MST. 
This is a synchronous model that completes in several rounds. 
In terms of rounds the complexity of our algorithm are 0(1) 
in the best case, 0(\g n) in the worst case. The proposed 
algorithm can also handle multiple concurrent node failure. 

II. Reconstruction of Aggregation Tree 

Consider the connected WSN consisting of n sensor nodes 
(vertices). Each sensor has its unique id, a variable (initially 
zero), where edge weight is the communication cost between 
two nodes. We assume that if one node fails the communi- 
cation graph is still connected and by some fault detection 
algorithm neighbors of the faulty node can detect the fault. 



We assume at a time there is only one faulty node in the 
WSN. Our proposed algorithm is synchronous; i.e., its perform 
in several rounds. Due to failure of a node, the aggregation 
decompose in to disjoint set of forest (cluster, say). According 
to the algorithm each cluster will find the minimum outgoing 
edge (synchronously) and tries to merge with the cluster on 
the other side of the edge. This is a distributed algorithm based 
on message passing. 

A. Notations 

Following notations are used throughout the paper for 
different type of message. These message are required during 
execution of the algorithm. 

• findjmsg (Find message): Fault detective node (cluster 
root, say) initiates the message within the cluster to 
invoke the node(s) for finding moe. 

• report_msg (Report message): Every leaf node in the 
cluster sends a report_msg with moe information and 
own id to its parent after finding moe from it, and every 
intermediate node sends report_msg to its parent after 
getting information about the moe of its subtree including 
itself. 

« test_msg (Test message): A node issue a test_msg 
message through the moe to know whether this edge is 
going to some other cluster. 

• accept_msg (Accept message): A node generates a 
accept_msg message after receiving test_msg message 
if the test_msg message sender is belonging to different 
cluster. 

• reject_msg (Reject message): A node generates a 
reject_msg message after receiving test_msg message 
if the test_msg message sender is belonging to the same 
cluster. 

« inform_msg (Inform message): cluster root sends this 
message to the node in which the moe is attached. 

> merge_req (Merge Request): Merging request from one 
cluster to some other cluster, containing cluster id. 

m internal _msg (Internal message): This message is for 
pass the information in the same cluster. 

• merge_msg (Merge message): To ensure merging be- 
tween two cluster. 

• commit_msg (Commit message): Final commitment 

• ignorejmsg (Ignore message): Ignore requests. 

• modify_msg (Modify message): This message is gener- 
ated by the end points of minimum outgoing edge after 
merging and pass in the new cluster to find the new root. 

III. Description of the Algorithm 

Suppose a sensor node with degree k is faulty in the initial 
aggregation tree T a . Removal of this faulty node decomposes 
the aggregation tree into k number of trees (or clusters), 
Ti, T2, ■ ■ ■ ,Tk (say). Then let us assume by some fault detec- 
tion algorithm the node, vf {root of the cluster, say) directly 
attached with the faulty node in each cluster, Ti can find the 
information about the fault and starts following reconstruction 
process. 



A. Subround-I: Minimum outgoing edge (moe) finding 

For each cluster Tj, vf named as root node initiates and 
sends find_msg to its descenders within the cluster through 
the tree edges with the id of the root, named as T\ d , which is 
same as vf. After receiving findjmsg every other nodes 
assign T\ d to its local variable (cluster jd) and forwards 
the message to neighbors until it reach to leaf nodes. After 
receiving findjmsg leaf node finds the moe and returns 
a report_msg to the sender of findjmsg. After receiving 
report jmsg all intermediate nodes modify moe if possible 
with respect to its own moe and forward the report jmsg to 
the root node. For finding moe a node passes test jmsg with 
cluster jd through the possible moe to test whether the other 
end of this moe is in the different cluster. If the other end of 
moe is in different cluster than the node returns a accept jmsg 
with its own id otherwise the node returns a reject jmsg. 

After receiving reject jmsg this node again tries to find 
the next possible moe among its neighbours until it receive a 
accept jmsg or there is no possible moe edge for node. In that 
case the node marks all such rejected edges not to use further 
for moe selection. There may be a possibility of multiple moe 
at any individual node. In this case the node selects moe with 
minimum id node among the multiple accept jmsg. 

After receiving report jmsg the root node finally selects 
a moe for the cluster and sends in form jmsg to the corre- 
sponding node w ™ oe (say) attached with the moe. 

B. Subround-II: Merge message passing 

The node, v™ oe of each cluster, Tj sends a mergejreq 
message along their respective moe to some node of Tj, say. 
The decision after receiving mergejreq message as following: 
There are two cases: 

1) If root, vf of Tj receives mergejreq and if the 
clusterjid of Tj is less than the cluster jd of Tj then 
vf returns an ignorejmsg to v™- oe , otherwise vf keep 
the information in its database. 

2) If some other node (vj) excluding vf of Tj receives 
a mergejreq and if cluster jd of Tj is less than 
the cluster jd of Tj then the node Vj returns an 
ignorejmsg to w™ oe , otherwise Vj forwards the mes- 
sage (internal jmsg) to the root vf. 

C. Subround-III: Decision after receiving a merge messages 

At the end of the previous Subround-II if root of Tj for 
some j receives one or more than one mergejreq messages 
then it finds the minimum cluster jd over all messages and 
sends a mergejmsg to the minimum id cluster and sends 
ignorejmsg to all others directly or via Vj node (Vj is 
considered in the case-2 of Subround-II). Now, if root of Tj 
for some j does not receive any mergejreq or receive but 
pass a ignorejmsg to sender then the root of Tj sends a 
mergejmsg through the moe (chosen in Subround-II) from 

v rnoe no( j e 



D. Subround-IV: Merging of clusters 

In this subround each cluster Tj, for i = 1, 2, • ■ • , k 
some node Vi (including root) receives mergejmsg and/or 
ignorejmsg from Vj (including root) of some other cluster 
Tj. If the message is ignorejmsg then drop the message. 
Otherwise merge these two clusters in the following ways: 

1) If Vi sends a mergejmsg to Vj and if T\ d < T- d then 
Tj sends a commit jmsg to Tj and Tj merge with Tj 
by including the edge in the modified aggregation tree. 
After that the vertices attached with the edge initiate 
modify jmsg over the new cluster T^ (, say) with the 
information of vf for the modification of root. If Vi 
sends a mergejmsg to Vj and if T- d > Tj d then 
mergejmsg is drop without merging. 

2) If Vi does not send a mergejmsg to Tj then Vi sends 
a commit jmsg and a modify jmsg (as a responds) to 
cluster Tj after receiving modify jmsg from its own 
cluster. Then Tj merge with T by including the edge in 
the modified aggregation tree and Tj expand. 

IV. The Algorithm 

G c = (V, E) <— Communication graph 

T a <— Initial aggregation tree 

k <— Degree of the faulty node, Vf 

Subround-I : (Finding moe) 

for each cluster Tj ; i = 1 to k do 

root, vf initiates and sends < find jmsg, T\ d > 

for each node Vi do 
cluster jdi T id 

end for 

for each node «j (starts from leaf nodes) do 

passes test jmsg through its moe EE of G c to some 

other node vy 

if T id ^ Tf, d then 

«j/ returns an accept jmsg to Vi 

Vi passes a reprotjmsg to the findjmsg sender 
else 

vy returns reject jmsg to v j and marks this rejected 
edge in E and Vi looks for the next possible moe 
end if 
end for 

for each node «j (intermediate/root) do 

After receiving report jmsg the node modifies moe 
if possible wrt its own moe as above and forwards 
reportjmsg to its ancestor until it reaches to the root 

When root receives the v" loe then it passes the 
inform jmsg to the «™ oe if moe is not attached with 
the root 
end for 
end for 

if there is no moe then 

return Tree is reconstructed & the protocol is terminated 
else 

moves for the Subround-II 



end if 

Subround-11 : (Merge message passing) 
for each cluster Tj ; i = 1 to k do 

y moe sen( j s a mer g e _ r eq from cluster Tj to some Tj 
end for 

if Vj of Tj receives this merge_req message then 
if Cf < Cf then 

passes an ignorejmsg to v™ oe 
else 

keeps the message 
end if 
else 

if some other node vj of Tj receives this merge_req 
message then 

if Cf < C\ d then 

passes an ignorejmsg to v™' oe 
else 

Vj receives this message and passes it to Vj of Tj 
through an internal jmsg 
end if 
end if 
end if 

Subround-III : (Decision after receiving a merge messages) 
for each cluster T ; i = 1 to k do 

if vf receives mergejreq from some other clusters then 
sends mergejmsg to the minimum id cluster among 
them and ignore_msg to others 
else 

sends mergejmsg from v™- oe through moe 
end if 
end for 

Subround-IV : (Merging of clusters) 

v l of T receives either merge_msg or/and ignorejmsg 
from Tj after the end of Subround-III 
if message is ignorejmsg then 

drops the message without merging 
else 

if T also sends a mergejmsg to Tj then 

if T\ d < T] d then 

Tj passes a commit jmsg to Tj 

Tj merges with Tj in some new cluster T i 

the nodes attached with merged edge initiates and 

sends modify jmsg within T i . 

clustered of w <E T i resets the value by vf 

else 

drops this received mergejmsg 
end if 
else 

v l sends a commit jmsg and forwards modify jmsg 
(as a responds) to cluster Tj after receiving 
modify jmsg from its own cluster and then Tj merges 
with Tj 
end if 
end if 



Re-execute the protocol from Subround-I with modified 
clusters until termination. 

V. Complexity Analysis 

Let k be the number of clusters after a node failure. We are 
measuring the complexity of the proposed algorithm in terms 
of rounds of execution and total number of message exchange. 
First we concentrate over possible best and worst rounds of 
execution. 

• Case-1 (Best Case) If «™ oe sends mergejreq to the min- 
imum id cluster Tj (,say) for all i e {1, 2, • • • , k} \ {j}, 
then the tree would be reconstructed in one round. 

• Case-2 (Worst Case) If every distinct pair of clus- 
ters exchange merge_req in Subround-II and merge in 
Subround-IV then in one round number of cluster reduces 
by half. If this kind of merging process is continue then 
after 0(lg k) rounds the tree would be reconstructed. 

Now we determine an upper bound for the number of messages 
for a cluster Tj. 

Let the number of nodes in this cluster is rij. Recall the 
types of messages used by the algorithm : 
findjmsg: n» — 1 findjmsg messages. 
test jmsg: (successful test and failed test.) 
accept jmsg: Acceptance requires two messages, success- 
ful test and accept. So the messages are 2nj. Note that 
inform jmsg also included in this count. 
reject jmsg: Note that an edge can be reject at most once 
throughout the execution of the algorithm. Rejection requires 
two messages: failed test and reject. So we have 2E messages. 
reportjrnsg: n, — 1 report_msg. 
merge_req: 1 (one) request for merging. 
ignorejmsg: at most k — 1 ignorejmsg throughout the 
execution of the algorithm. 
internal jmsg: at most m — 1 message. 
mergejmsg: one message. 

commit jmsg: one message for final commitment 
modify jmsg: rij — 1 messages for modification. 

The total number of message required for a cluster is 6rtj. 
Total number of message for all k clusters is J2i=i(^ n i) = 
6(n — 1) where n — 1 = X^j=i n i 

Therefore the total number of message for merging of all 
k clusters is 0(n\g k + E). Here k may be n — 1, therefore 
the total counting brings us to 0(n\g n + E). 

VI. Correctness 

Note that in a single round of proposed algorithm, every 
cluster sends a unique mergejmsg through moe. In the 
merging of two or more than two clusters simultaneously there 
is exactly two clusters which sends a mergejmsg to each 
other through the same moe. 

Theorem 1: There is no cycle after merging two or more 
clusters. 

Proof: Let T be the initial aggregation tree with n nodes 
and Vf be the faulty node. Proof by induction on degree of Vf 
node in T a . 



Basis: Let deg(w/) = 1. Then after removing Vf from T a , there 
is only one cluster with n — 1 nodes. Clearly T a with n — 1 
nodes is again a tree. 

Let deg(w/) = 2 and T^, 7} be the clusters. Let us suppose 
cycle occurs in the merging of Ti and Tj. It is possible if 
both Ti and Tj send a merge_msg to each other through 
different multiple moe. But this contradicts Subround-III of 
the proposed algorithm. Since according to proposed algorithm 
both Ti and Tj send a merge_msg to each other through same 
moe. Hence there is no cycle in the merging of Ti and Tj. 
Inductive hypothesis: Let no cycle occurs in the merging of k 
or less clusters, i.e., deg(u/) < k. 

Inductive step: Now let deg(w/) = k + 1 and T, for i = 
1, 2, • • • , k + 1 be the clusters. Let us suppose cycle occurs in 
the merging of these k + 1 clusters. It is possible if at least 
three cluster T\, T 2 , T 3 (, say) send the merge_msg to each 
other as T x to T 2 , T 2 to T 3 , T 3 to 7\ in a round. But this 
contradicts our algorithm that there are exactly two clusters 
which send a merge_msg to each other through the same 
moe in the merging of more than two clusters. Therefore cycle 
cannot occur in a round and number of clusters reduces. Now 
by inductive hypothesis cycle will not occur in the merging 
of k + 1 clusters. Hence theorem is true for any number of 
clusters. ■ 
Theorem 2: Resultant reconstructed aggregation tree is 
again a MST 

Proof: Let T a be the initial aggregation tree and given 
that is a MST with n nodes and v/ be the faulty node 
with degree k. Let T a be the aggregation tree which is 
reconstructed using our proposed algorithm with n — 1 nodes 
after removing the faulty node Vf . Since T a is a MST, 
therefore removal of vj divides it in to k sub trees where 
each of them are individually a MST. Now suppose T a is not 
a MST, it means there are at least two clusters which is not 
merged with a minimum weighted edge in the T a . But it is 
a contradiction of our algorithm that allows merging between 
different clusters through a minimal weighted edge. Hence the 
resultant reconstructed aggregation tree is again a MST. ■ 

VII. Multiple Sensor Nodes Failure 

If m number of nodes fail simultaneously and if 
di, e?2, • • • , d m are the degrees of respective faulty nodes then 

at most di+d 2 -\ \-d m number of disjoint forest may form. 

Then same proposed algorithm can merge all disjoint forest 
and reconstruct the aggregation tree. 

VIII. Conclusion 

In this paper, we have proposed a distributed algorithm for 
reconstruction of aggregation tree in wireless sensor networks 
when an arbitrary sensor node fails during aggregation. Our 
model is synchronous, performing in rounds. In terms of 
rounds the time complexity of our algorithm is 0(1) in the 
best case, 0(lg n) in the worst case. Our proposed algorithm 
can also handel multiple concurrent sensor node failure. But 
the proposed algorithm cannot handel node failure during 
the reconstruction phase. In our future works we will try to 



incorporate node failure during the reconstruction phase as 
well. 
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